CDS

Accession Number TCMCG075C05416
gbkey CDS
Protein Id XP_007042161.2
Location join(4125455..4125564,4125737..4127324)
Gene LOC18607763
GeneID 18607763
Organism Theobroma cacao

Protein

Length 565aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007042099.2
Definition PREDICTED: aureusidin synthase [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category S
Description Protein of unknown function (DUF_B2219)
KEGG_TC -
KEGG_Module -
KEGG_Reaction R00031        [VIEW IN KEGG]
R00045        [VIEW IN KEGG]
R02078        [VIEW IN KEGG]
KEGG_rclass RC00046        [VIEW IN KEGG]
RC00180        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K00422        [VIEW IN KEGG]
EC 1.10.3.1        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00350        [VIEW IN KEGG]
ko00950        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
map00350        [VIEW IN KEGG]
map00950        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGAGAAGAAGAAATGGATTCTTACAGTTGTGTTAGCCCTCATCGTGGCAATGCTGCCTTTGACTTTTCGAATCCTGGAATCGCATCAAGTTCAGCGTTTATATACGGGGGAACTAACAGACATGATACTCGGAAAGCTGGGAGGCCGGGCTTCCAGTAACCACGACCTTAGCACAGACAAGGCTGTAGCGAGCAAATTTATCGCCCCGAACTTAACAGCGTGCCACCCATCATATGGTCGTCCAGATCTTCTTGTTCACTGTTGTCCTCCGGGGTTTGAATCGCCAGTGCCCTTTGTCGATTTCCAGTTTCCTGATCCTCAATCGCCAAAACGTGTGCGCAGGCCAGTCCAACTTGTGGACGAGAACTACATCGCCAAATACAACAAGGCTTTGTCGATCATGAAGTCCTTGCCATACGATGATCCTCGAAGTTTTGCCCGTCAAGCCAACTTGCACTGTCTCTTTTGTACTGGAGCCTACGACCAACAAAACTCCAATACCCCTCTTAGTATTCACAGAACATGGTTATTCTTTCCCTGGCACCGCATGATGATCTACTTCCACGAACGCATCATCGGTAGTCTAATCGGAGATGACACGTTTGCTTTTCCGGTTTGGACTTGGGACATCCCTGAAGGAATGGTGATGCCGGATATTTACGCGAACATGAATTTATCATTCTTTCACAAGGTACGTGACTTTTCACATTTTCCACCGCGGGTGGCAGATTTGAACTACTTCGAGGAAACAAATTTGAGTCCTCAAGAGCAGTTGGATACAAACTTGGCATTTATGTATAACCAAATGGTATCTGGTGCAAAGAAGACGGAATTGTTCATGGGATGCACATATAAAGCCAATGAAGGATATTGTAATTCACCCGGCACTGTAGAGAGTGCCCCTCACAACACTTTGCATACATGGGTAGGGAGCAATCTAGAACCTGGAAGGGAGGATATGGGTAAATTCTACTCAGCAGCAAGAGACCCTATTTTCTATGCACATCATTCCAATATAGATCGTCTTTGGGAAGTTTGGAGGGAGATTCATAAACATGAATTGGATATCAAAGATCCAGATTGGCTAAACTCTTTCTTTTTCTTTTATGATGAGAACTTGAAGCTGGTAAAGATTAAGGTTCGTGATGTTCTTGATATCTCCAAACTTGGTTATTCTTACGAGGAGGTCGATCGTCCGTGGTTGAATAAACGTCCCACGCCTTCAGTTCCGCCAAAGGTAGCCCGTCAGATATTAAAATCGAAAGAGAATGAGAACCAATTCCGACTGTCCTCTGATTTCGGGCCCCATGGTCGAGCTCTAGACGCTAGCTTGACAGTAAAGGTTAACAGGTCTAAAAATCATTTGACCAAGAGGGAGAAAGGAGAGGAAGAGGTCATAGTTGTTCACGGCATTGAAGTGAAAGGGGACGCATATGTTAAGTTTGATGTGTATGTAAACATGGTTGATCAGACGATAATCTCTCCAAAGTCCAGGGAATTCGCGGGGACCTTCGCTCACATTCCTGGGGGTGGGGAGATGATGAAGAGGAAGATCGATCTCAAACTGGGAGTGTCAGAACTATTGGAAGATTTGGAAGCAAAGGAAGATGAAAGCATCTGGGTCACATTGTTGCCAAGGACAGCAAGTTGTAGCAGTGTAACAATTGAAGGAGTACAAATTAAGTATATCAAATAA
Protein:  
MEKKKWILTVVLALIVAMLPLTFRILESHQVQRLYTGELTDMILGKLGGRASSNHDLSTDKAVASKFIAPNLTACHPSYGRPDLLVHCCPPGFESPVPFVDFQFPDPQSPKRVRRPVQLVDENYIAKYNKALSIMKSLPYDDPRSFARQANLHCLFCTGAYDQQNSNTPLSIHRTWLFFPWHRMMIYFHERIIGSLIGDDTFAFPVWTWDIPEGMVMPDIYANMNLSFFHKVRDFSHFPPRVADLNYFEETNLSPQEQLDTNLAFMYNQMVSGAKKTELFMGCTYKANEGYCNSPGTVESAPHNTLHTWVGSNLEPGREDMGKFYSAARDPIFYAHHSNIDRLWEVWREIHKHELDIKDPDWLNSFFFFYDENLKLVKIKVRDVLDISKLGYSYEEVDRPWLNKRPTPSVPPKVARQILKSKENENQFRLSSDFGPHGRALDASLTVKVNRSKNHLTKREKGEEEVIVVHGIEVKGDAYVKFDVYVNMVDQTIISPKSREFAGTFAHIPGGGEMMKRKIDLKLGVSELLEDLEAKEDESIWVTLLPRTASCSSVTIEGVQIKYIK